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Abstract 

We present a general theory of series-parallel mental architectures with selectively influenced stochastically non- 
independent components. A mental architecture is a hypothetical network of processes aimed at performing a task, 
of which we only observe the overall time it takes under variable parameters of the task. It is usually assumed that 
the network contains several processes selectively influenced by different experimental factors, and then the question 
is asked as to how these processes are arranged within the network, e.g., whether they are concurrent or sequential. 
One way of doing this is to consider the distribution functions for the overall processing time and compute certain 
linear combinations thereof (interaction contrasts). The theory of selective influences in psychology can be viewed 
as a special application of the interdisciplinary theory of (non)contextuality having its origins and main applications 
in quantum theory. In particular, lack of contextuality is equivalent to the existence of a “hidden” random entity of 
which all the random variables in play are functions. Consequently, for any given value of this common random 
entity, the processing times and their compositions (minima, maxima, or sums) become deterministic quantities. 
These quantities, in turn, can be treated as random variables with (shifted) Heaviside distribution functions, for 
which one can easily compute various linear combinations across different treatments, including interaction contrasts. 
This mathematical fact leads to a simple method, more general than the previously used ones, to investigate and 
characterize the interaction contrast for different types of series-parallel architectures. 

KEYWORDS: interaction contrast, mental architectures, noncontextuality, response time, selective influences, 
series-parallel network. 
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1 Introduction 

The notion of a network of mental processes with components selectively influenced by different experimental factors 
was introduced to psychology in Saul Sternberg’s (1969) influential paper. Sternberg considered networks of processes 
a,b,c,... involved in performing a mental task. Denoting their respective durations by A.B.C ..., the hypothesis he 
considered was that the observed response time T is A+B+C + -... One cannot test this hypothesis, Sternberg wrote, 
without assuming that there are some factors, a, (3, y,..that selectively influence the durations A, B,C ..., respectively. 
Sternberg’s analysis was confined to stochastically independent A,B,C ,..., and the consequences of the assumptions 
of seriality and selective influences were tested on the level of the mean response times only. 

Subsequent development of these ideas was aimed at the entire distributions of the response times and at a greater 
diversity and complexity of mental architectures than just series of “stages.” This development prominently includes 
Roberts and Sternberg (1993), Schweickert, Giorgini, and Dzhafarov (2000), Schweickert and Townsend (1989), 
Townsend (1984, 1990a, 1990b), Townsend and Nozawa (1995), Townsend and Schweickert (1989), and several 
other publications, primarily by James Townsend and Richard Schweickert with colleagues. For an overview of 
these developments see Dzhafarov (2003) and Schweickert, Fisher, and Sung (2012). In the present context we should 
separately mention the development of the ideas of stochastic ordering of processing times in Townsend (1984,1990a) 
and Townsend and Schweickert (1989); as well as the idea of marginal selectivity (Townsend & Schweickert, 1989). 

The notion of selective influences also underwent a significant development, having been generalized from stochas¬ 
tically independent random variables to arbitrarily interdependent ones (Dzhafarov, 2003; Dzhafarov & Gluhovsky, 
2006; Kujala & Dzhafarov, 2008; Dzhafarov & Kujala, 2010; in press). The essence of the development is easy to 
understand using two random variables (e.g., process durations) A. B selectively influenced by two respective factors 
a, (3. In Dzhafarov’s (2003) notation, this is written (A. B) H 3 (a, p). According to the definition given in Dzhafarov 
(2003), this means that there are functions / and g and a random variable R (a common source of randomness) such 
that /(a,R) = A and g(P,R) = B. If such a choice of ( f,g,R ) exists, it is not unique. For instance, R can always 
be chosen to have any distribution that is absolutely continuous with respect to the usual Borel measure on the real 
line (e.g., a standard uniform, or standard normal distribution, see Dzhafarov & Gluhovsky, 2006). However, a triple 
(f,g,R) need not exist. It does not exist, e.g., if marginal selectivity (Townsend & Schweickert, 1989) is violated, 
i.e., if the distribution of, say, A at a given value of a changes in response to changing p. But marginal selectivity is 
not sufficient for the existence of a triple (f,g,R). Let, e.g., a and P be binary factors, with values 1,2 each, and let 
the correlation p between A and B for a treatment (a, p) be denoted p a p. Then the triple in question does not exist 
if the correlations violated the “cosphericity test” (Kujala & Dzhafarov, 2008), also known in quantum mechanics as 
Landau’s inequality (Landau, 1989): 


|PllPl2 — P2lP22| < PllPl2 + P2lP22> 


( 1 ) 


where p a p = y I — pjy. There are many other known conditions that must be satisfied for the existence of a triple 
(f,g,R) when marginal selectivity is satisfied (Dzhafarov & Kujala, 2010, 2012a,b, 2013, 2014a). 

The allusion to quantum mechanics is not accidental: as shown in Dzhafarov and Kujala (2012a,b), the theory of 
selective influences in psychology can be viewed as a special application of the theory of (non)contextuality. This 
theory is interdisciplinary (Dzhafarov & Kujala, 2014b-d; Khrennikov, 2009), but its origins are in quantum theory, 
dating from Kochen and Specker (1967) and John Bell’s (1964, 1966) celebrated work. For the modern state of the 
theory see Dzhafarov, Kujala, and Larsson (2015). A simplified account of the (non)contextuality analysis of the 
example given above is as follows. One labels each random variable in play contextually, i.e., by what property is 
being measured/recorded under what treatment (context): 


/ 


Rvalue of a 

yproperty: what is measured J 


\ (value of a, value of P) / 

-- ' 


context: under what treatement 




^value of (3 

property: what is measured// 


(value of a, value of P) 


context: under what treatement 


( 2 ) 


The notation here is, of course, redundant, because the context and property identifiers overlap, but we need now to 
emphasize the logic rather than achieve notational convenience. Once the labeling is done, one looks at all possible 
joint distributions imposable on all these random variables, for all properties and all treatments. A system is noncon- 
textual if there exists such a joint distributions in which any two random variables that represent the same property 
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(“what is measured”) are equal with probability 1. The latter is possible only if the random variables representing the 
same property always have the same distribution: in our case 

(A„) (a ' p) ~ (A«)(“-P'), (fi p ) (a ’ p) ~ 13 ) (3) 

for any values a, PjOt'jp' of the two factors. This is called consistent connectedness (Dzhafarov, Kujala, & Larsson, 
2015), and in physics is known under a variety of names, including (in certain paradigms) “no-signaling condition” 
(Cereceda, 2000; Masanes, Acin, & Gisin, 2006; Popescu & Rohrlich, 1994). In psychology, this is marginal selec¬ 
tivity. The definition of noncontextuality just given is not the most general one, as the notion of contextuality can be 
extended to inconsistently connected (violating marginal selectivity) systems (Dzhafarov, Kujala, & Larsson, 2015), 
but we do not need this generality in this paper. What is important for us here is that the existence of a joint distribution 
mentioned in our definition is equivalent to the existence of a random variable R and the functions f,g mentioned in 
the introductory paragraph. 

It is easy to show (Dzhafarov, 2003) that the existence of a triple (f,g,R) for given joint distributions of (A. B) 
under different treatments (a, (3) is equivalent to the existence of a quintuple (/' ,g' ,S,Sa,Sb), where S.Sa , 5/; are 
random variables, such that f (a, S, Sa) =A and g' (fi, S, Sb) =B. In such a representation, one can speak of a common 
source of randomness S and specific sources of randomness Sa,Sb ■ In Dzhafarov, Schweickert, and Sung (2004) this 
representation was used to investigate different series-parallel arrangements of the hypothetical durations A and B. The 
reason this representation has been considered convenient is that if one fixes the value S = s, then f (o,,s. Sa ) = A c 
and g' ($,s,Sb) = B, are stochastically independent random variables. One can therefore use theorems proved for 
stochastically independent selectively influenced components (Schweickert, Giorgini, & Dzhafarov, 2000) to obtain a 
general result by averaging across possible values of s. For instance, let a, (3 be binary factors (with values 1,2 each), 
and let us assume that the observed duration r a p is min (A a ,B p) for every treatment (a, (3). Then T^p^, = min (A asi B p v ) 
for every value S = s, and it is known that, for the independent A asi B p v (satisfying a prolongation condition, as 
explained below), 

Pr(7j u <f)-Pr(r 12 ,<f)-Pr(72i,<f)-1-Pr(72 2 ,<t)<0. (4) 

Since this should be true for every value S = s, then it should also be true that 

C(0=Pr(7h <0—Pr(7j 2 </)-Pr(Z2i < t)+ Pr(T 22 <t) <0. (5) 


This follows from the fact that 

Pr (T a p < t) = j Pr (r a p, < /) dm (s), (6) 

where m (s) is the probability measure for S, and the integration is over the space of all possible s. The linear combi¬ 
nation C(t) in ([3 is called the interaction contrast of distributions functions. 

The Prolongation Assumption used in Dzhafarov et al. (2004), and derived from Townsend (1984, 1990a) and 
Townsend and Schweickert (1989), is that, for every S = s, 

Pr (A i. s < t) > Pr (A 2j < t) , Pr (B\ s <t)> Pr (B 2s < t ) . (7) 


For this particular architecture, T = min (A, B), this is the only assumption needed. To prove analogous results for more 
complex mental architectures, however, one needs additional assumptions, such as the existence of density functions 
for A as ,Z?p s at every ,v, and even certain ordering of these density functions in some vicinity [0, x]. 

The same results, however, can be obtained without these additional assumptions, if one adopts the other, equiva¬ 
lent definition of selective influences: f(a,R) = A and g ((3,/?) = B, for some triple (f,g,R). If such a representation 
exists, then 

aar = f(u,r),bp r = g{$,r) (8) 

are deterministic quantities (real numbers), for every value R = r. Any real number v in turn can be viewed as a random 
variable whose distribution function is a shifted Heaviside function 


h (t — x) = 


0, ift < x , 
1, ift > x. 


(9) 


In particular, the quantity t a p r = min (a ar .b^ r 'j for the simple architecture T 
tributed according to 

h{t-t a $r) =ha$r(t). 


min(A,B) considered above is dis- 

( 10 ) 
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Let us see how inequality (0 can be derived using these observations. 

We first formulate the (conditional) Prolongation Assumption , a deterministic version of 0: the assumption is 
that fjg,R can be so chosen that for every R = r, 

air < air, b\r < b 2r - (11) 

Without loss of generality, we can also assume, for any given r, 

a\ r <b\r (12) 

(if not, rename a into b and vice versa). 

Remark 1.1. The prolongation Assumption clearly implies 0. Conversely, if 0 holds, one can always find functions 
f,g,R for which the Prolongation Assumption holds in the form above. For instance, one can choose R = (S,Sa,Sb), 
take Sa and Sb to be uniformly distributed between 0 and 1, and choose / (a,. ..) ,g (p... .) to be the quantile functions 
for the hypothetical distributions of A and B at the corresponding factor levels. 

We next form the conditional (i.e., conditioned on R = r) interaction contrast 

Cr{t) = hnr(t)-hl2r{t)-h2lr(t)+h 2 2r{t). (13) 

Notation Convention. When r is fixed throughout a discussion, we omit this argument and write a a . Z?p , f a p .h a ^(t).c (t) 
in place of a ar ,Z?p n f a p r ,/! a p,.(f),c,-(f). (For binary factors a, (3, we also conveniently replace a, (3 in indexation with 

ij-) 

Following this convention, there are three different arrangements of a\ .a 2 -b\,b 2 (for a given R = r) satisfying 

CD O: 

(i) a\ <b\ <a 2 < b 2 

(ii) a\ <a 2 <bi <b 2 (14) 

(ill) a\ <b\ <b 2 < a 2 

In all three cases, 


For arrangement (i) we have 


t\\ = min(ai,^i) =ci\= min(ai,bi) = h 2 . 


(15) 


+^n (t) = 1 
—h\ 2 (t) = -1 

—/z 2 i ( t ) = -0 

+h 22 (f)=0 
= c(0 =0 


+hn (0 = 1 

-hn (0 = -l 
-/ z 2 l (0 = -1 
+h 22 (0=0 
= c (0 = -1 


tu=ti 2 =ai <1< t 2 i=bi <t < t 22 = a 2 

This diagram shows the values of /z, ; > ( t) and the resulting values of c, (t ) as t changes with respect to the fixed positions 
of tij r (with index r dropped everywhere). Analogously, for arrangements (ii) and (iii), we have, respectively 

c (0 = 0 

tn=ti 2 =ai <f< t 2 \ — t 22 — a 2 


and 


•(0 = o 


: (f) <0 


t 2 i = b i 


hi = 1 12 =a i 


< t < 


<t< 


t 2 2 = b 2 
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In all three cases, c(f) is obviously zero for t < t\\ and t > tji- We see that c(t) = c r (t) < 0 for all t and every R = r. 
It follows that C (t ) < 0, because 

Pr (Tij<t) = j h ijr (t)dp(r), (16) 

for i,j £ {1,2}, and 

C(t)= Jc r (t)dp(r)<0, (17) 

where p is the probability measure associated with R and the integration is over all possible r. We obtain the same 
result as in ([5]), but in a very different way. 

In this paper we extend this approach to other mental architectures belonging to the class of series-parallel net¬ 
works, those involving other composition operations and possibly more than just two selectively influenced processes. 
In doing so we follow a long trail of work mentioned earlier. When dealing with multiple processes we follow Yang, 
Fific, and Townsend (2013) in using high-order interaction contrasts. All our results are replications or straightforward 
generalizations of the results already known: the primary value of our work therefore is not in characterizing mental 
architectures, but rather in demonstrating a new theoretical approach and a new proof technique. 

1.1 Definitions, Terminology, and Notation 

Since we deal with the durations of processes rather than the processes themselves, we use the term composition 
to describe a function that relates the durations of the components of a network to the overall (observed) duration. 
Formally, a composition is a real-valued function t = t (a,b,... ,z) of an arbitrary number of real-valued arguments. 
The arguments a,b,...,z are referred to as durations or components. In this article, we will use X AY A ... A / to 
denote min(Y,T,... ,Z), and X VY V ... VZ to denote max(X,T... ,Z). 

A series-parallel composition (SP) is defined as follows. 

Definition 1.2. (1) A single duration is an SP composition. (2) If X and Y are SP compositions with disjoint sets of 
arguments, then X A Y, X V Y, and X +Y are SP compositions. (3) There are no other SP compositions than those 
construable by Rules 1 and 2. 

Remark 1.3. The requirement that X and Y in Rule 2 have disjoint sets of arguments prevents expressions like X AX 
or X +XV Y. But if the second X inXAI is renamed into X 1 . or X X Y in X + X V Y is renamed into Z, then the 
resulting X AX' and X + Z are legitimate SP compositions. This follows from the generality of our treatment, in which 
different components of an SP composition may have arbitrary joint distributions: e.g., X and X' in X AX' may very 
well be jointly distributed so that Pr [X = X'] = 1. One should, however, always keep in minds the pattern of selective 
influences: thus, if X is influenced by a, then Z is also influenced by a in X + Z above. 

Any SP composition is obtained by a successive application of Rules 1 and 2 (the sequence being generally 
non-unique), and at any intermediate stage of such a sequence we also have an SP composition that we can term a 
subcomposition. 

Definition 1.4. Two durations X,Y in an SP composition are said to be parallel or concurrent if there is a sub¬ 
composition of this SP composition of the form SP 1 (X,X \...) A SP 2 ( Y,Y ',...) (in which case X,Y are said to be 
min-parallel) or SP 1 (X,X ',...) V SP 2 (Y,Y , I ...) ( X,Y are max-parallel). X,Y in an SP composition are said to be 
sequential or serial if there is a subcomposition of this SP composition of the form SP 1 (X, X \...) + SP 2 (Y, Y ',...). 

Definition 1.5. An SP composition is called homogeneous if it does not contain both A and V in it; if it does not 
contain A, it is denoted SPv; if it does not contain V, it is denoted SPa- 

The only SP composition that is both SPa and SPv is a purely serial one: a + b + ... + z. Most of the results 
previously obtained for mental networks are confined to homogeneous compositions. We will not need this constraint 
for the most part. 

Since we will be dealing with compositions of more than just two components, we need to extend the definition 
of selective influences mentioned above. In the formulation below, ~ stands for “has the same distribution as.” A 
treatment <> = (A^ . • • ■ > K'„) is a vector of values of the factors A. 1 ,..., A.", the values of A k (k= 1..... /z) being indicated 
by subscripts, A^. 
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Definition 1.6. Random variables (X 1 ,... ,X n ) are selectively influenced by factors (A- 1 ,..., A,”), respectively, 

(X l ,...,X") <-P (AV.-.A"), (18) 

if for some random variable R, whose distribution does not depend on (A 1 ,... A"), and for some functions gi,... ,g„, 

(xl,...,x;)~( gl (XlR),...,g n (XZ,R)), (19) 

for any treatment (f) = ^A? t ,..., A? ''j. 

In the subsequent discussion we assume that all non-dummy factors involved are binary in a completely crossed 
design (i.e., the overall time T is recorded for all 2" vectors of values for (])). When we have random variables not 
influenced by any of these factors, we will say they selectively influenced by an empty set of factors (we could also, 
equivalently, introduce for them dummy factors, with one value each). 


2 SP Compositions Containing Two Selectively Influenced Processes 

Consider two processes, with durations A and B in an SP composition. The overall duration of this SP composition 
can be written as a function of A,B and other components: T = T(A,B ,...). We assume that A. B. and all other 
components are selectively influenced by a, (3, and empty set, respectively: (A.B... .) <-P (a, (3,0). Let each factor has 
two levels: a = 1,2 and (3 = 1,2, with four allowable treatments (1,1), (1,2), (2,1), and (2,2). The corresponding 
overall durations (random variables) are written as 7Ti, 7i2, T 21 , and T 2 2 . 

By Definition ! 1.6l of selective influences, each process duration (a random variable) is a function of some random 
variable R and the corresponding factor: A = a (c i,R), B = b( (3 . R). For any given value R = r , the component durations 
are fixed numbers, 

a (a = l,r) = ai r , a (a = 2,r) = < 22 ,-, 

b{$ = l,r)=b lr , b($ = 2,r)=b lr , (20) 

x(0,r) = x r , 

where x is the value of any duration X in the composition other than A and B. We assume that R is chosen so that the 
Prolongation Assumption (ITO holds, with the convention ( 1 1 21 . 

The overall duration T at R = r is also a fixed number, written as (recall that we replace a, (3 in indexation with 

i,j) 

T (a ir , P j r ,...) = tjj r , i, j e {1,2}. (21) 

The distribution function for f i; > is the shifted Heaviside function hij r (t) = h(t — f^>), 

*-1 (22) 


The conditional interaction contrast c r (t) is defined by (fl3l) . Denoting by H,j(t) the distribution function of 7) ; , we 
have 

H ij(t)= [ hij r (t)d/Jr, (23) 

with denoting the set of possible values of R. For the observable (i.e., estimable from data) interaction contrast 

C it) = H x \ (t) - H n (i t ) - H 2l (?) + H 22 ( t) , (24) 

we have then 


C(t) 




-(f) dq r . 


(25) 
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Note that it follows from our Prolongation Assumption that 

Hn (t) > H \2 (t) , #21 (0 > H 22 (t) , Hn(t) > H 21 (t), H \2 it) > H 22 (f) ■ 
We also define two conditional cumulative interaction contrasts (conditioned on R = r): 

) dx. 


c{0,t) = [ c(x)< 

Jo 

= / c(x)dx=lim / c(x)dx. 

Jt 


The corresponding observable cumulative interaction contrasts are 


C(f,°°) = / c(f,°°)d ( u,. = 

J%. 


\Jt 


•(x)dx 


( 26 ) 

(27) 

(28) 


c(x)d^,-^ dx = 

[ C(x)dx. 

IQ 

(29) 

c (x) dp ,dx = 

j C(x)dx. 

(30) 


In these formulas we could switch the order of integration by Fubini’s theorem, because, for any interval of reals I, 

f |c(x)|d(xx p r ) < f 2d(xx j u r ) <2. (31) 

JlxtR JlxtR 


2.1 Four lemmas 

Recall the definition of c r (t ) in ( 1 1 31 ). We follow our Notation Convention and drop the index r in c r (t ) and all other 
expressions for a fixed r. 

Lemma 2.1. In any SP architecture, for any r, 

til < fi 2 Af 21 < 1 12 V ?21 < ill- 

Proof Follows from the (nonstrict) monotonicity of the SP composition in all arguments. □ 

Lemma 2.2. In any SP architecture, for any r, c ( t ) equals Ofor all values oft except for two cases: 

(Case+) if tn < t < tn/\t 2 i, then c(t) = l— 0 — 0 + 0>0, 
and 

(Case - ) ifti 2 \/t 2 i < t < t 22 , then c(f) = 1 — 1 — l+0<0. 

Proof By direct computation. □ 

Lemma 2.3. In any SP architecture, for any r, c (t) < 0 for all values oft if and only ift\ 1 = /12 A ti\; c (t ) > 0 for all 
values oft if and only ift 12 V /21 = t 22 - 

Proof. Immediately follows from Lemma lZ2l □ 

Lemma 2.4. In any SP architecture, for any r, 

(i) c(0,t) = Jo c(x) dx > 0 for any t if and only if — 1 11 + 1 12 + ?2i — hi > 0, 

(ii) c(t,°°) = f t °° c(x)c/x < 0 for any t if and only if —t\ \ -\-t 12 +hi —hi < 0, 

(Hi) lim^ooC (0,f) = 0 if and only if —t\ \ +fi2 + (21 — hi = 0. 

(iv) lim r ^oc(t,°°) = 0 if and only if —fn -\-tn+hi - hi = 0. 
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Proof. Without loss of generality, put t\i < ?21 ■ We have 


ro 


c (0 ,t) = < 


(t-tu) 

(t — tn) — (t — tn) = ft2 — fit 
{t — tn) — (t — tn) — {t — ?2i) 

= -fit +fl2 + *21 — t 
(* — tu) — (t — tn) — (t — *2l) + {t — t2l) 
= —*11 + *12 +*21 —t22 


if t < tn 
if tn < t < t\2 
if t\2<t < t2\ 

if t2\ <t < t22 

if t > ?22 


The expressions for the first three cases are obviously nonnegative. If —t\ 1 + f 12 + *21 — *22 > 0, then c (O.t) > 0 for all 
t in the last case (t > t 2 i). With — tu +/12 + *21 ~t 22 > 0, we have — 1\\ + *12 + *21 — * > *22 — t > 0 for the fourth case 
(f>i < f < ? 22 )- Hence c(0,f) > 0 for all f if —tn +fi 2 + *21 — *22 > 0. Conversely, if c (0,/) >0 for all t, then it is also 
true for t > C 2 , whence -fn + *i 2 +* 2 i —*22 > 0. 

The proof for c(t,°°) = f t °° c (x) r/x requires replacing it first with c (x) dl < 0 for some u > ? 22 - We have 


if r < tn 
if *11 < * < /12 
if t\2<t < t2l 
if ?21 < t < t2 2 
if t > t22 

The expressions for the last three cases are obviously nonpositive. If —t\ \ +t 12 + *21 ~ *22 < 0, then f u c i2> (x) t/x < 0 
for all t in the first case ( t < fn). With — 1\\ + * 12+*21 — *22 < 0, we have — t + * 12+*21 — *22 < fit — * < 0 for the 
second case {tn < t < tn). Hence f u c(x)r/x < 0 for all t if — 1\ \ + *12 + *21 — *22 < 0 . Since in all expressions u is 
algebraically eliminated, they remain unchanged as u -2 °o. Conversely, if c (f,») <0 for all t, then it is also true for 
t < tu, whence -tu + *12 + *21 -*22 < 0. 

The statements (iii) and (iv) follow trivially. □ 




dl= < 


(« - *11) - (« - *12) - (w - *21) + (k - *22) 
= —t\\ + *12 + *21 —t22 
(u-t)- (w -*12) - (w-* 2l) + (m -* 2 2) 

= — t + *12+ *21 ~ *22 
(u — t) — (u—t) — (h —*2l) + (w —*22) 

= t 2 l ~t 22 

(u — t) — (u — t) — (u — t) + (u — t22) 

= t~t 22 

(u — t) — (u — t) — (it — t) + (m — t) 

= 0 


2.2 Parallel Processes 

2.2.1 Simple Parallel Architectures of Size 2 

A simple parallel architecture corresponds to one of the two compositions: T =A/\BoxT =A\/B, with (A. B) <-P 
(a, (3). Recall the definition of C (t) in (l24l . 

Theorem 2.5. ForT =AAB, wehavec(t) < 0 for any r,t; consequently, C (t) <0 for any t. ForT =A\/B, we have 
c (f) > 0 for any r,t; consequently, C (t) > 0 for any t. 

Proof. For T = A AB with the Prolongation Assumption ([TTl i-(IT2l>. we have 

fit =a\ Ab\ =a\,tn=a\Ab2,t2\ = 02 A b\. 


It follows that 


?12 A? 2 t = fli AfcjAfliAfei = a 1 = fn- 

By Lemma 12731 c{t) < 0. As C (f) in (1251 ) preserves the sign of c (t), we have C (f) < 0. For T =A/\B , we have 

t: 22 = «2 V b 2 , t \2 = ai V ^ 2 , A 1 = 02 \/b\. 
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It follows that 


tl2 V to\ = a\ Vi>2 Vfl2 Vfci = t22, 


whence, by Lemma 1231 c (t) > 0 and therefore C ( t ) > 0. 


□ 


2.2.2 Two Parallel Processes in an Arbitrary SP Network 

Consider now a composition SP(A,B,...) with ( A,B ,...) <-P (a, (3,0). 

Lemma 2.6. IfA.B in SPiA.B....) are parallel, then SP(A,fi,...) can be presented as A' /\B' if they are min-parallel, 
or as A' V B' if they are max-parallel, so that (A 1 ,B r ) <-P (a, (3) and, for any fixed R = r, the Prolongation Assumption 
holds. 

Proof By Definitions II. 2l and l 1.41 if A,B are min-parallel, then SP/\(A,B,.. .) can be presented either as 

SP^A^OASP 2 ^,...) 


or 


(SP 1 (A,...) A SP 2 (.B,...) + A) AT, 


or else 


(SP 1 (A,...) A SP 2 (B,...) A A) +7, 

where B does not enter in S P 1 and A does not enter in S P 2 . On renaming 

SP 1 (A,...)aSP 2 (B,...), 


=B' 


(SP 1 (A,...)ASP 2 (B,...)+A)A7 = (SP 1 (A,...)+A)a(SP 2 (B,...)+A)AF, 


=b' 


and 


(SP 1 (A,.. .) A SP 2 (fi,...) A A) + 7 = (SP 1 (A,...) + Y) A (SP 2 (fi,...) A A + Y ), 


=A' =B' 

we have, obviously, ( A',B') <-P (a, (3). Fixing R = r, by the (nonstrict) monotonicity of SP compositions, 

a\ = SP 1 (ai,...) < SP 1 (« 2 ,...) =a' 2 


and 

/7 , 1 =SP 2 (/7 1 ,...)<SP 2 (/72,...)=^2 

We can also put a\ = SP 1 {a\ ,-) < SP 2 (Z?i,...) = b\ (otherwise we can rename the variables). The proof for the 
max-parallel case is analogous. □ 

Theorem 2.7. IfA,B in SPIA.B.. ..) are min-parallel, then c (t) < 0 for any r,t; consequently, C (t) < 0 for any t. If 
A,B in SP(A.B.. . .) are max-parallel, then c (/) > 0 for any r,t; consequently, C ( t ) > 0 for any t. 

Proof. Immediately follows from Lemma f2~6l and Theorem l275l □ 


2.3 Sequential Processes 

2.3.1 Simple Serial Architectures of Size 2 

Simple serial architectures of size 2 corresponds to the SP composition T =A+B, with (A.B) <-P (a, (3). Recall the 
definitions of the two cumulative interaction contrasts: (I27l >- (l28l > and (I29l >-(l3()b. 
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Theorem 2.8. IfT =A+B, then c (Of) >0 and c (t,°°) < 0 for any rf; moreover, 

limc(0,f) = limc(/,°°) = 0, 
r-> o 

for any r,t. Consequently, C (0,f) > 0, C (t,°°) < 0 for any t, and 

limC(0,f) = limC = 0 

r->°° f->0 


Proof Follows immediately from Lemma IZ4l since 

—t\l +02 + ^21 — t22 = — (fll + & l) + (fll + ^2) + (^2 + ^l) — (^2 + ^2) = 0. 


□ 


2.3.2 Two Sequential Processes in an Arbitrary SP Network 

Consider now a composition SP(A,B,...) with ( A,B ,...) <-P (a, (3.0). 

Theorem 2.9. If A and B are sequential in an SP(A .B....) composition, then one or both of the following statements 
hold: 

(i) c (0,/) > 0 for any r,t, and C (0,/) > 0 for any t, 

(ii) c(t,°°) < 0 for any rf, and C(t ,°°) < 0 for any t. 

Proof In accordance with Definitions ll.21 and ll.4l SP(A,B,...) with sequential ,4. B can be presented as either 

(SP l (A,...) + SP 2 (B,...))AX + Y (32) 

or 

(SP 1 (A,...) + SP 2 (S,...))VA + 7 (33) 

(note that any Z in SP 1 (A,...) + SP 2 (fi,...) +Z can be absorbed by either of the first two summands). For both cases, 
by the monotonicity of S P compositions, for any R = r, S P 1 (at, ■ • •) < SP 1 (a 2 , ...), SP 2 (b u . ..) < SP 2 (fe 2 , • • •), and 
we can always assume SP 1 (a\ .-■)< SP 2 (£>i, ...). Denoting the durations of SP 1 (a,-,...) + SP 2 (bj,.. .) by t-j, we have 
therefore, by Theorem 12. 8 1 — t' n + tf + tf — tf = 0. Denoting the durations of X and Y by t' and t", respectively, in 
the case (|32] > we have 

ti^t'ijAt' + t". 

By Lemma l2Al all we have to show is that — 1 11 + 1 12 + /21 — ?22 > 0. It is easy to see that t" does not affect this linear 
combination, and its value is (assuming t[ 2 < t ' 2l , without loss of generality) 


0 



if t' < t' n 



—f ll 

+ t' 


if In 

<t' 

< 

t'n 

-t'n 

+ f 12 


if Il2 

<t' 

< 

t'n 

-t'n 

+ t'n 

+ ?2l — ^ 

if t'n 

<t' 

< 

t > 

'22 

-t'n 

+ t'n 

Jrt 2\~ t 22 

if t' > f 22 




The nonnegativity of the first three expressions is obvious, the fifth one is zero, and the forth expression is larger than 
the fifth because t' < t 22 . 

The proof for the case ( |33] > is analogous. □ 

If the SP composition with sequential A, B is homogeneous (Definition ! 1.51) . the statement of theorem can be made 
more specific. 

Theorem 2.10. If A and B are sequential in an SPa(A,B, ...) composition, then c (0 ,t) > 0 for any r.t, and C (Of) > 0 
for any t; if the composition is SP v(A,B,...), then c (f,°°) < 0 for any rf, and C (t,°°) < 0 for any t. 
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3 Multiple Processes 

We now turn to networks containing n >2 processes with durations (X 1 .... .X"), selectively influenced by factors 
(A, 1 ,... ,A"). In other words, we deal with compositions SP(X 1 ,... ,X n ,...) such that (X 1 ,... ,X n ,...) <-P (A 1 ,... ,A",0), 
where each if is binary, with values 1,2. There are 2" allowable treatments and 2" corresponding overall durations, 
7ii...i, 7) | 2 - ■ • •, 7x2... 2 - According to Definition ! 1.6l of selective influences, each process duration here is a function of 
some random variable R and of the corresponding factor, X k = x k (Rftf). For any fixed value R = r, these durations 
are fixed numbers for any given treatment, and so is the overall, observed value of the SP composition. We denote 
them 


and 


\rX = l)=4r,* Jc (rX = 2)= X * r 


(34) 


where 21 , 22 ,.. 


nA 


n^i2n 


• • • X n ri •••)>••• U\i 


i n £ {1,2}. The distribution function for is a shifted Heaviside function 


(35) 


hili 2 ...i„r (t) — 


0 , if t < t'nij-. inr 
lj if t f ti£l 2 ...i„r 


Denoting by . (t) the distribution function of we have 


Hhh-in 



hili 2 —i n r (t) d/2,-. 


(36) 


(37) 


Conditioned on R = r, the n-th order interaction contrast is defined in terms of mixed finite differences as 


cl (t ) — A;, A i 2 . . . Ai n hi x i 2 .,.i n r (t) , 

which, with some algebra can be shown to be equal to 

c!" l (,)= £ (-i)«+a., 

h,hr-7n 

Thus, 

(t)=Ai l hi l r(t)=hi r (t)-h 2 r(t) = J^(-i) 1+,1 /li 1 r(0» 

h 

Cr (0 = ^i2^hh r (0 = [^11 r (0 ^12 r {t )] [^21r (0 ^22 r {t )] 

= hnr(t)-hnr{t)-h21r{t)+h 2 2r{t)= I (-1 ) 2+, ' 1+ ^ h hhr (t) , 

hh 

cl (f) = Ai l Aj 2 Ai 3 hi l i 2 i i r (t) 

= {[h\Ur(t) —hnirit)] — [hmr(t) — hlllrit)]} - {[hnirit) —h2\2r{t)\ — \h22lr{t) -Il222r(t)]} 
= h\l\ r (t) — h\\2r ( t) —hn\ r {t) — h2Ur{t) + h\22r (0 + ^212r {t) + h.221r {t) — fl222r (0 

= E (_l ) 3 + h + 2 2 + 23 ^.. 2(yW) 

n,'2,'3 

etc. The observable distribution function interaction contrast of order n is defined as 


(38) 

(39) 

(40) 

(41) 

(42) 


C w (0= [ cl n) {t)d/j r . 
J'K. 


(43) 


By straightforward calculus this can be written in extenso as 

C«(0= £ (_i)»+E=i ik H h ... in (t), (44) 

— , 2/1 


or, in terms of finite differences. 
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C^(t) = A h A h ...A in H hh ... in (t). (45) 

This is essentially the high-order interaction contrast used by Yang, Fific, and Townsend (2013), the only difference 
being that they use survivor functions 1 —H(t) rather than the distribution functions H (t). We see that c r (t) and C(t) 
in the preceding analysis correspond to c| 2) (t) and C 1 ' 2 ' 1 (t), respectively. 

We also introduce n-th order cumulative contrasts. Conditioned on R = r, we define 

cP 1 (0,0 = cP 1 (t ,o°) = h\ r (0 - h 2r (0 , (46) 


etc. Generalizing, 


nt r°° 

Cp 1 (0,0 = J C^(tl)dtl, c| 21 (f,°°) = J | c| 2) (tl)dtl, 

c| 31 (0,0 = [ [ c?\t 2 )dt 2 dti, cP 1 (f,°°) = [ [ cf\t 2 )dt 2 dt]_, 
Jo Jo Jt Jti 

cl" 1 (0,0 = J (^J ■ J c^ (t„-i)dt n -i...dt 2 S j dti, 

cl" 1 (f,o°) = f if ... f c < f\t n -i)dt n -i...dt 2 ] dti. 

Jt \Jti Jt„_ 2 J 


(47) 

(48) 

(49) 

(50) 


The corresponding unconditional cumulative contrasts of the n-th order are, as always, defined by integration of the 
conditional ones: 


C 1 " 1 (0,0 = [ cl” 1 (0,0d/Jr = f ( [' ... f" 2 C^(t n - 1 )dt n - 1 ...dt 2 )dt l , 
J%. Jo \J 0 Jo J 

C 1 " 1 (f,oo) = f cl" 1 (t , °°) d/r r = f if ... f C (n) (t n -i )dt n - i...Jl 2 ] dt\. 

J J t ^ *n —2 J 


(51) 


(52) 


In the proofs below we will make use of the recursive representation of the conditional cumulative contrasts c|- 1 . 
It is verified by straightforward calculus. Denoting 


„(«-!) 


J (0 = 


E (_i iw i 


l w l w+\ ...tnrVJi 


( 0 , 


(53) 


*1 1 i l w +1 T--i l n 

where w £ and i w is fixed at 1 or 2, we have: 


cl* 1 (0,0 = cl 11 (t,oo) = hlr (0 - h 2r (0 , 


(54) 


cp 1 (0,0 = f c| 2) (x)c/x 

Jo 

= / {h\\r (x) h\ 2r (x) h 2 \ r (x) T h 22r (x)) dx 
Jo 


to L 


c|pLi, r (o-4L r (o 


dx 


( 55 ) 


-J o cJ.Li ir (°,x)rfx-jT c| 3 L 2r (0,x)Jx, 
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and generally, for n > 1, 


cl 21 


/ oo 

c? ] (x)rfx 

= f {hllr (x) - hl 2 r (x) - h 2 lr (x) + h 2 2 r (x)) dX 

= f KLi,(x)-c(;L v (x)]r/x 
= f C !l ] = C !1U ,(T,oo)rfT, 


c[ 31 (0,/)= [ f cf ] (t 2 )dt 2 dt\ 


/0 JO 

r 


<L 1 , r (f2)-<L v (f 2 ) 


dt 2 dt\ 


If 

f f c^l l r (t 2 )dt 2 - f cfj =2 r (t 2 )dt 2 

f c[ S=l,rM dT ~ f c £ ] =2,r (0,T)rfT, 


c/fi 


Cl 


[3] 


c 2=l,r ( ? 2 )-c| 2 L 2 , (f 2 ) 


dt 2 dt\ 


( t,°°) = f f cf’\t 2 )dt 2 dh 

-Iff] 

= [ f C t=l,r^ dt 2~ [ C 2=2,r( ? 2)<fe 

Jt \_Jt\ Jt 1 

II 


ft\ 

= l t c| 2 Li ir (x,oo)dx-jT cj 2 L 2r (x,°o)Jx, 


rffl 


c "" ] ^ 0, ^ = i) c t=U °^ dx -! 0 c ‘Ja°^ 

c [ " ] (t ’ °°) = f c !" = llr ( T > °°) ^ - f C [ "~Jl r (x, °o) dX. 

Also we have, by substitution of variables under integral, 

c l"7 1] (0,f) = ci"~ 1] (0, 

c t 1] = c [ "“ 1] (f-x”; r ,oo). 

The Prolongation Assumption generalizing (ITTIi-(fT2l) is formulated as follows. 


(56) 


(57) 


(58) 


(59) 


(60) 


(61) 


(62) 


Prolongation Assumption. R and functions x 1 ,... ,x" in (l34l > can be chosen so that x\ r < x 2r for all R = r and for all 
k = 1 Without loss of generality, we can also assume x\ r < x\ r < ... < x\ r (if not, rearrange x \ r ,... ,x" r ). 

Notation Convention. As we did before for n = 2, when r is fixed throughout a discussion, we omit this argument 
and write x \,... , t hh ... in , h hh ... in (t), c {n \t) in place of x \ r ,... ,x” r , t ; y 2 .../„ r , h hh ... in r (t), cf\t). 
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3.1 Parallel Processes 

3.1.1 Simple Parallel Architectures of Size n 

Theorem 3.1. IfT = X 1 A ... AX", then for any r,t, c 1 "' 1 (t) < 0 if n is even and (t) > 0 ifn is odd. Consequently, 

for any t, C M ( t) < 0 ifn is even and C-”- 1 ( t ) > 0 ifn is odd. 

Proof. By induction on n, the case n = 1 being true by the Prolongation Assumption: 

c (1 ) (:?) = hi ( t ) —112 (: t ) > 0. 

Let the statement of the theorem be true for (?) , with n — 1 > 1. By the Prolongation Assumption, 

hi 2 ...i n =x}Ax? A...AX? =x\, 

for any L... i„, whence 


h\i 2 ...i n 



0, if t < x\ 
l,ift>x\ 


Therefore cj" = j * (?) 


= 0, and, applying the induction hypothesis to c 


(n~ 1) 
< 1=2 


( 0 , 


C«(?) 


C <i = l 


(?)-C 


(«-l) 

< 1=2 


(o= -ct" 


' 1=2 



< 0, if n is even 
> 0, if n is odd 


That C 1 ”- 1 (?) < 0 if n is even and C ( "j (?) > 0 if « is odd follows by the standard argument. 

Theorem 3.2. IfT = X 1 V.. .MX", then for any r,t, c ^ (?) > 0. Consequently, for any t, C^ n> (?) > 0. 
Proof. By induction on n, the case n = 1 being true by the Prolongation Assumption: 

(?) = h\ (?) — /i2 (?) > 0. 

Let the theorem be true for c^” _1 ^(?), where n — 1 > 1. Let 

4v« V...VX? =x7, 


where 1 <m <n. We have then 


tilh—im- l2l m +l --<n _ '2 ) 

and 

h • ... . (?) - / 0,ift<x% 

for all i\...i m -\,i m+ \...i n . Then c-" = ^ (?) = 0, and 


t >*2 ’ 


c W (0 = c£:l ) (?)-c^ ) (?)=c£:l ) (?)>0. 

Consequently, C (, 'j (?) > 0, for any ?. 


□ 


□ 


3.1.2 Multiple Parallel Processes in Arbitrary SP Networks 

In a composition SP (X 1 ,... ,X n ,.. .), the components X 1 ,... ,X" are considered parallel if any two of them are par¬ 
allel. We assume selective influences (X 1 ,... ,X", .. .) <-p (A, 1 ,... ,X" .&). We do not consider the complex situation 
when some of the selectively influenced processes X 1 ,... ,X" are min-parallel and some are max-parallel. However, if 
they are all (pairwise) min-parallel or all max-parallel, we have essentially the same situation as with a simple parallel 
arrangement of n durations. 
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Lemma 3.3. IfX 1 ,... ,X" are all min-parallel or max-parallel in an SP composition, this composition can be pre¬ 
sented as T = A 1 A . .. AA" or T = A 1 V...VA", respectively. In either case, (A 1 ,... ,A") <-P (A, 1 ,..., X") and the 
Prolongation Assumption holds for any R = r. 

Proof. For the min-parallel case, by a minor modification of the proof of Lemma 12761 we present the SP composition 
as 


SP 1 (X 1 ,...)ASP 2 (X 2 ,...,X",...), 

s. y ✓ 

—A 1 

or 

(SP 1 (X 1 ,...) ASP 2 (X 2 ,... ,X n ,...) +X) AT = (SP 1 ^ 1 ,...) +X) A (SP 2 (X 2 ,... ,X n ,...) +X) AT, 

—A 1 

or else 


(SP 1 (X 1 ,...)ASP 2 (X 2 ,...,X",...)AX)+T = (SP l {X\...)+Y) A (SP 2 (X 2 ,... ,X",...) AX+ Y). 

—A 1 


Then we analogously decompose SP 2 (A 2 ,... ,X n ,...) achieving A 1 A A 2 A SP 3 (X 3 ,... ,X n ,...), and proceed in 
this fashion until we reach the required A 1 A... A A". The pattern of selective influences is seen immediately, and the 
Prolongation Assumption follows by the monotonicity of the SP compositions. The proof for the max-parallel case is 
analogous. □ 

Theorem 3.4. IfX 1 ,... ,X n are min-parallel in an SP composition, then for any r,t, c^ n - (t) < 0 if n is even and 
cW (t) > 0 if n is odd. Consequently, for any t, C W ( t) < 0 if n is even and C*”-* (f) > 0 if n is odd. IfX 1 ,... ,X" are 
max-parallel, then for any r,t, c 1 "-* (/) > 0, and for any t, (A") (/) > 0. 

Proof. Follows from Lemma l3.3l and Theorems l3.1l and l3.2l □ 

3.2 Sequential Processes 

3.2.1 Simple Serial Architectures of Size n 

Theorem 3.5. IfT = X 1 +... +X", then for any r,t, (0 ,t) > 0, while (f ,oo) < 0 if n is even and > 0 

if n is odd; moreover, (0,°°) = 0 for any r. Consequently, for any t, (0,t) > 0, while Cl"l ( t ,°°) <0 if n is even 
and Cl n 1 (f,°°) > 0 if n is odd; moreover, Cl n l (0,°°) = 0. 

Proof. By induction on n , the case n = 1 being true by the Prolongation Assumption: 

c« (0 ,t) =c [1] (t,° o) =h\ (t) —h 2 {t) > 0, 


and 

limcW (0 ,t) = liinc^ (f,°°) = 0. 

t-A°° t-A 0 

Let the statement of the theorem hold for all natural numbers up to and including n— 1 > 1. Using the recursive 
representations (EH)-®, 

cl"] (0,0 = Jo cfz}} (0,x) dx -/o (0,0 dx 

= fo ^ c[ " -11 (0, x) dx — ^ cl n ~ l l (0, x) dx , (63) 

= f t -$cl n ~ 1 H 0 ,x)dx 

which is > 0 since (0,x) > 0 and t — x™ < t —xf Analogously, 

cl"] (t,o o) = /“ c!"! 1 ! 1 (t, oo) dx - f t °° cf~J (x, oo) dx 

= ft-xy c[n ~ 1] ( T > °°) dT - ft-x!!’ c[n ~ 11 ( X > °°) dx 

= d"-' ] (x,oo)dx 


( 64 ) 
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which is < 0 if n is even and > 0 if n is odd. Applying the mean value theorem to the results of ( 1631 and (l64l >. we get, 
for some t — x™ < t', t" < t — x* 


rt~x” 

/ C [' , - 1 ](0,x) C /T = c["' 1 ] (0/) (~xY+j&), 

J t—x 2 

rt-x\ 

/ (T,oo)dx = cl”- 1 ] (f",oo) (-X? +X%) , 

Jt-xy 


and, as cl” '1 (0,°°) = 0, both expressions tend to zero as, respectively, t —> °° (implying t' —> °°) and t —>0 (implying 


0 ). 


□ 


3.2.2 Multiple Sequential Processes in Arbitrary SP Networks 

In a composition S P (A 1 ,..., X n ,...), the components X 1 ,..., X n are considered sequential if any two of them are 
sequential. By analogy with Theorem l2.9l for two sequential processes and with Theorem [64] for parallel X 1 ,...,A", 
one might expect that the result for the simple sequential arrangement A 1 + ... + X" will also extend to n sequential 
components of more complex compositions SP (A 1 ,... ,A”,...). However, this is not the case, as one can see from 
the following counterexample. 

Consider the composition 

SP(A 1 ,A 2 ,A 3 ,T) = (A 1 +A 2 +A 3 )A(T = 2), (65) 

with (A 1 , A 2 , A 3 ) selectively influenced by binary factors, so that 


x 

x 


l 

l 

1 

2 



( 66 ) 


It follows that 

fill = 0 , 

*112 =*121 =*211 = 1, (67) 

*122 = *212 = *221 = *222 = 2 . 

This is clearly a sequential arrangement of the three durations A 1 , A 2 , A 3 , but one can easily check that cl 3 l (O.t) here 
is not nonnegative for all t. For instance, at t=3 we have, by direct computation, cl 3 l (O.t) = —1. We conclude that 
there is no straightforward generalization of Theorems 13 .5 1 to arbitrary SP compositions. 


4 Conclusion 

The work presented in this paper is summarized in the abstract. By proving and generalizing most of the known results 
on the interaction contrast of distribution functions, we have demonstrated a new way of dealing with SP mental 
architectures. It is based on conditioning all hypothetical components of a network on a fixed value of a common 
source of randomness R (the “hidden variable” of the contextuality analysis in quantum theory), which renders these 
components deterministic quantities, and then treating these deterministic quantities as random variables with shifted 
Heaviside distribution functions. 

The potential advantage of this method can be seen in the fact that the shifted Heaviside functions have the simplest 
possible arithmetic among distribution functions: for every time moment it only involves 0’s and l’s. As a result, the 
complexity of this arithmetic does not increase with nonlinearity of the relations involved. Thus, Dzhafarov and 
Schweickert (1995), Cortese and Dzhafarov (1996), and Dzhafarov and Cortese (1996) argued that composition rules 
for mental architectures need not be confined to +, A, V. They analyzed architectures involving other associative and 
commutative operations, such as multiplication. Due to mathematical complexity, however, this work was confined to 
networks consisting of two components that are either stochastically independent or monotone functions of each other. 
It remains to be seen whether the approach presented here, mutatis mutandis , will lead to significant generalizations 
in this line of work. 

The limitations of the approach, however, are already apparent. Thus, we were not able to achieve any progress 
over known results in applying it to Wheatstone bridges (Schweickert & Giorgini, 1999; Dzhafarov et ah, 2004). 
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The possibility that the “architecture” (composition rule) itself changes as one changes experimental factors makes 
the perspective of a general theory based on our approach even more problematic (e.g., Townsend & Fific, 2004). It 
seems, however, that these problems are not specific for just our approach. 
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